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background: The controversies concerning possible overtreatment of prostate cancer, highlighted by debate over PSA screening, 
have highlighted active surveillance (AS) as an alternative management option for appropriate men. Regional differences in the 
underlying prevalence of PSA testing may alter the pre-test probability for high-risk disease, which can potentially interfere with the 
performance of selection criteria for AS. In a multicentre study from three different countries, we examine men who were initially 
suitable for AS according to the Toronto and Prostate Cancer Research International: Active Surveillance (PRIAS) criteria, that 
underwent radical prostatectomy (RP) in regards to: 

1. the proportion of pathological reclassification(Gleason score ^7, ^pT3 disease), 

2. predictors of high-risk disease, 

3. create a predictive model to assist with selection of men suitable for AS. 

methods: From three centres in the United Kingdom, Canada and Australia, data on men who underwent RP were retrospectively 
reviewed (n = 2329). Multivariable logistic regression was performed to identify predictors of high-risk disease. A nomogram was 
generated by logistic regression analysis, and performance characterised by receiver operating characteristic curves. 
results: For men suitable for AS according to the Toronto (n = 800) and PRIAS (410) criteria, the rates for upgrading were 50.6, 
42.7%, and upstaging 1 7.6, 1 2.4%, respectively. Significant predictors of high-risk disease were: 

• Toronto criteria: increasing age, cT2 disease, centre of diagnosis and number of positive cores. 

• PRIAS criteria: increasing PSA and cT2 disease. 

Cambridge had a high pT3a rate (26 vs 1 2%). To assist selection of men in the United Kingdom for AS, from the Cambridge data, we 
generated a nomogram predicting high-risk features in patients who meet the Toronto criteria (AUC of 0.72). 
conclusion: The proportion of pathological reclassification in our cohort was higher than previously reported. Care must be used 
when applying the AS criteria generated from one population to another. With more stringent selection criteria, there is less 
reclassification but also fewer men who may benefit from AS. 
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Treatment paradigms for small-volume, low-grade prostate cancer 
(PCa) are currently moving away from radical treatments. The recent 
controversial US preventative task force report (Lin et al, 2011) giving 
PSA screening a 'D' rating, prompted by the results from the PLCO and 
ERSP trials, has highlighted the issues associated with overdiagnosis 
and overtreatment of PCa. In follow-up of over 3,500 patients across 
seven active surveillance (AS) case series, the cancer-specific survival 
for the cohort is 99.7% (Cooperberg et al, 201 1), though median follow- 
up remains relatively short (2-7 years). Active surveillance has 
emerged as a viable management option and should be offered to 
patients with low-risk cancer (National Institute for Health and Clinical 
Excellence, 2008; National Institute of Health, 2011). 
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There are many different inclusion criteria for AS published in 
the literature. While all are variations on the model developed by 
Epstein et al (1994), the discrepancies between them reflect the 
uncertainty in appropriate cutoffs to distinguish indolent from 
high-risk cancer. In addition, regional differences in the under- 
lying prevalence of PSA testing in the community alters the pre- 
test probability for high-risk disease (Moore et al, 2009) in defined 
'low-risk' cohorts, which potentially interferes with the perfor- 
mance of selection rules when applied to populations distinct from 
which they were generated. 

In our study, we examine application of AS selection rules to a 
combined Australian, British and Canadian group of patients who 
underwent radical prostatectomy (RP). Our primary objective was 
to document the proportion of pathological reclassification from 
prostate biopsy to RP. Secondary aims included analysis for 
predictors of high-risk disease and creation of a predictive model 
to assist with selection of men suitable for AS. 
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MATERIALS AND METHODS 

Pooled prospectively collected data from Addenbrooke's Hospital, 
Cambridge, United Kingdom (2005-2010); The Vancouver 
Prostate Centre, Canada (1995-2010); and the Australian Prostate 
Cancer Centre at Epworth, Melbourne, Australia (2003-2010) were 
retrospectively analysed. All patients had their RP specimens 
discussed at centralised multidisciplinary meetings and evaluated 
by dedicated genitourinary pathologists. Cambridge and Vancouver 
had routine centralised review of biopsies. Ethics approval in all three 
centres covers the use of collected clinical information for prognostic 
studies. 

A summary of the literature review for published inclusion 
criteria used for AS is shown in Table 1 (Hardie et al, 2005; 
Warlick et al, 2006; Dall'Era et al, 2008; Klotz et al, 2009; van den 
Bergh et al, 2009; Soloway et al, 2010; Adamy et al, 2011). In 
selecting which AS criteria to apply to our series, we used the 
University of Toronto criteria, described by Klotz et al (2009), 
from the first protocol-based AS prospective study and those 
published from the Prostate Cancer Research International: Active 
Surveillance (PRIAS) originating from the ERSPC (van den Bergh 
et al, 2007). Our cohort did not contain data for the amount of 
cancer present in a biopsy core (length or percentage) as only 
Cambridge consistently recorded this in their database so the 
PRIAS criteria were the strictest applicable to our data set. 

Patients treated by means of RP who had preoperative 
parameters appropriate for inclusion for AS per these criteria 
had final pathology analysed for reclassification rates of upstaging, 
defined as ^pT3, or upgrading, defined as Gleason sum 7-10. 
Gleason 7 disease was subdivided into 3 + 4 and 4 + 3 groups. 
High-risk disease was defined as ^pT3 and/or Gleason sum ^8. 
CAPRA-S scores, a validated postsurgical score to predict PCa 
recurrence using pre-treatment and pathological data, for risk 
stratification were also calculated (Cooperberg et al, 2011). Lymph 
node involvement was not analysed, as across all centres there was 
no consistent policy regarding the performance of a pelvic lymph 
node dissection in patients with low-risk disease, and hence data 
collection was poor. 

Differences between groups of continuous variables were 
determined by Mann-Whitney U or Kruskal-Wallis ANOVA. 
Pearson's j 1 or Fisher's exact test was used to determine 
differences between groups of categorical variables. To identify 
predictors of high-risk disease in patients selected for AS, logistic 
regression models were fitted, including parameters age, PSA, 
PSAD, clinical stage, number of biopsy cores taken, number of 
positive biopsy cores and centre of treatment as individual terms. 
Statistical analyses were performed using SPSS version 18.0 (IBM 
Corporation, Armonk, NY, USA), and all tests were two-sided with 
significance assumed at P<0.05 unless otherwise stated. 

To generate a clinically usable tool that predicted the presence of 
high-risk disease in the Cambridge cohort that met the Toronto 
criteria, all factors found to be significant in multivariate analysis 



were considered and the most parsimonious model generated from 
these. Patients were randomly assigned 70:30 to a learning and 
evaluation cohort. Logistic regression analysis was then performed 
using all available potential predictors of high-risk disease in the 
learning cohort, and a nomogram of the resulting equation 
generated using Orange (http://orange.biolab.si V2.0b, accessed 5 
August 2011). The discriminative ability of our nomogram to 
predict high-risk disease was characterised by generating receiver 
operating characteristic curves based on the predicted probabil- 
ities of the evaluation cohort. A Loess calibration plot was used to 
assess the performance of our model across the entire range of 
predicted values, as the tool may have excellent overall accuracy 
but may not perform well in a specific range of predicted 
probabilities. A two-component (calibration and AUC calculation) 
decomposition Brier score was calculated, with a lower Brier score 
indicating better discriminant properties (Eng, 2006). 



RESULTS 

Of the 2329 patients that had RP, 800 patients met the Toronto 
criteria for AS, and this number was reduced to 410 patients when 
the stricter PRIAS criteria were applied. The preoperative 
characteristics for these two groups are shown, overall and by 
treatment centre, in Tables 2a and 2b. The pathology results from 
RP, including proportions reclassified and final risk group, also 
divided by treatment centre, are shown in Tables 3a and b. Overall, 
for those satisfying the Toronto criteria, 50.6% were upgraded to 
GS ^7 and 17.6% upstaged to pT3/4. The reclassification rates for 
the PRIAS group were 40.5% and 12.4%, respectively. For both 
groups, the majority of GS upgrading consisted of 3 + 4 disease 
(Toronto = 84% and PRIAS = 79%). In Cambridge, there was a 
relatively high rate of pT3a disease in the Toronto criteria group 
(26%) that decreased with PRIAS criteria (14%). Melbourne had a 
relatively high rate of Gleason 7 disease in final pathology for both 
the Toronto and PRIAS criteria groups (61 and 52%). 

Given that study periods at different centres were different, to 
account for changes in biopsy technique, pathological interpreta- 
tion and treatment patterns with time, we repeated the analysis 
in a restricted cohort (year ^2003 and total number of cores 
taken at biopsy ^8) more reflective of contemporary practice. 
Results of this subanalysis for the Toronto and PRIAS inclusion 
criteria are shown in Table 4 and were similar to the initial overall 
analysis with similar proportions of GS upgrading (49.4% Toronto, 
41% PRIAS) and pT3/4 upstaging (Toronto 17.9% and PRIAS 
11.3%). 

Using standard reported clinico-pathological variables, we were 
unable to derive a more accurate model that predicted indolent 
disease than the PRIAS criteria. However, we were able to identify 
significant predictors for high-risk disease (Table 5a) by multi- 
variate logistic regression analysis. For patients meeting the 
Toronto criteria, increasing age, number of positive cores, the 



Table I Published criteria for active surveillance 





Principal 


Clinical 


PSA 


Biopsy Gleason 


PSA density 


No. of +ve 


% of single 


Institution 


investigator 


stage 


(ngml 1 ) 


score 


(ngml ~ 1 ml ~ ') 


cores 


core 


Royal Marsden 


Parker 


TI-T2 


< 15 


^3 + 4 




< 50% total cores 




University of Toronto 


Klotz 


Tlc/T2a 


< 10 


^6 








PRIAS 


Shroder 


Tlc/T2 


< 10 


<6 


<0.20 


^2 




UCSF 


Carroll 


TI-T2 


< 10 


<6 




< 1 /3 of total cores 


^50% 


MSK 


Eastham 


Tl-T2a 


^ 10 


^6 




^3 


^50% 


University of Miami 


Soloway 


^T2 


< 10 


^6 




<2 


^20% 


John Hopkins Medical Institution 


Carter 


Tic 




^3, no pattern Gleason 


<0.I5 




^50% 










score 4 or 5 









Abbreviations: MSK = Memorial Sloan Kettering; PRIAS = Prostate Cancer Research International: Active Surveillance; PSA = prostate-specific antigen; UCSF = University of 
California, San Francisco. 
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Table 2a Preoperative characteristics for patients suitable for AS according to the Toronto criteria - by centre 



Total 



Cambridge 



Melbourne 



1469 



Vancouver 



Years of data 
N (total RP) 
N (total AS) 


1995-2010 

2329 
800 (34.3%) 


2005-2010 

700 
267 (40%) 


2003-2010 

790 
187 (31%) 


1995-2010 

839 
190 (32%) 




Age (years) 
Median (IQR) 


61 (56.7- 65) 


61 (39 -73) 


60 (42 - 74) 


61 (43-79) 




PSA (ngmr 1 ) 
Median (IQR) 












5.8 (4.7 - 7.4) 


6.4 (0.5- 10) 


5.5 (0.3- 10) 


5.5 (0.5- 10) 




Clinical stage 
cTI 
cT2a 


570 (71.3%) 
230 (28.7%) 


206 (77%) 
59 (23%) 


149 (80%) 
38 (20%) 


109 (58%) 
80 (42%) 


'■5 

3 
(/) 


PSA density* 
Median (IQR) 


0.1 (0.086-0.151) 


0.108 (0.01 1 -0.816) 


0.1 14 (0.014-0.315) 


0.1 1 (0.012-0.371) 


"5 


Biopsy cores taken 
Median (range) 


1 0 (2 - 30) 


1 2 (4 - 30) 


1 1 (4 - 30) 


8 (2 - 1 3) 


u 


Number of positive cores 
Median (IQR) 


2(1-4) 


2(1-14) 


3 (1 - 12) 


2 (1 -8) 




Table 2b Preoperative characteristics for patients suitable for AS according to the PRJAS criteria 


- by centre 








Total 


Cambridge 


Melbourne 


Vancouver 




N (total RP) 
N (total AS) 


2329 
410 (18%) 


700 
134 (19%) 


790 
1 14 (14%) 


839 
162 (19%) 




Age (years) 
Median (IQR) 


60.5 (56.3-65) 


62 (45-70) 


59 (48-73) 


62 (43-73) 




PSA (ngmr 1 ) 
Median (IQR) 


5.6 (4.3-7) 


6.3 (2.7-10) 


5.6 (0.3-10) 


5.4 (1.5-0.4) 




Clinical stage 
cTI 
cT2a 


287 (70%) 
1 23 (30%) 


109 (81%) 
25 (19%) 


91 (80%) 
23 (20%) 


87 (54%) 
75 (46%) 




PSA density* 
Median (IQR) 


0.1 (0.071-0.13) 


0.99 (0.02-0.19) 


0.1 (0.01-0.19) 


0.1(0.01-0.2) 




Biopsy cores taken 
Median (range) 


9 (2-24 - IQR) 


1 2 (6-24) 


10 (3-17) 


8 (2-12) 




Number of positive cores 
Median (IQR) 


1 


1 (1-2) 


2(1-2) 


2(1-2) 




Abbreviations: AS = active 
RP = radical prostatectomy. 


surveillance; IQR = interquartile range; PRIAS = Prostate Cancer Research International: Active Surveillance; 
a Not part of the original Klotz criteria but reported for comparison with Van den Bergh. 


PSA = prostate-specific antigen; 





presence of palpable disease as well as the centre of diagnosis were 
all significant predictors of the presence of high-risk disease; 
whereas for patients meeting the more stringent PRIAS criteria, 
only increasing PSA and the presence of palpable disease were 
significant. Total number of cores taken was analysed and not 
predictive. 

A number of subgroup analyses were undertaken to search for 
predictors of more advanced disease. Examining for cancer present 
in a biopsy core (percentage) as a predictor in only the Cambridge 
cohort did not yield a significant result. Repeating multivariate 
logistic regression analyses, limited by year of surgery (^2003) 
and number of biopsy cores taken (^8) for high-risk disease 
( ^ pT3 or Gleason sum ^ 8) or ^ pT3 alone, demonstrated similar 
predictors to those found for the entire cohort. There were no 



significant predictors of primary Gleason pattern 4 (as opposed to 
Gleason sum 8) identified. 

We were unable to generate a nomogram from the whole cohort 
data because of difficulties modelling sampling error. To account 
for sampling error, we used an index of prostate size to number of 
cores taken; however, this did not significantly improve the 
performance of the model. As the rate of pT3a was high in the 
Cambridge data, we performed logistic regression analysis to 
identify predictors of ^pT3a/GS^8 disease specifically for the 
Cambridge cohort alone (Table 5b). When only Cambridge data 
were analysed for the nomogram, PSAD was more predictive than 
PSA. From this analysis, a nomogram (Figure la) was generated 
that predicts individual risk of high-risk disease in UK men who 
meet the Toronto criteria, based on age, PSA density, number of 
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Table 3a Pathological results, with upgrading and upstaging rates 
highlighted, from radical prostatectomy for patients suitable for the 
Toronto AS criteria 





Total 
(Toronto) 


Cambridge 


Melbourne 


Vancouver 


P 


n 


800 


280 


248 


272 




Pathological GS 

^ 6 
Upgrading 

1 

3 + 4 

4 + 3 

8-10 


395 (47.8%) 

-inn /Ad / o/\ 

389 (48.6%) 
340 (87.4%) 
49 ( 1 2.6%) 
16(2%) 


157 (56%) 

1 1 1 f A ~\Q/\ 

1 17 (42%) 
1 08 (92%) 
9 (8%) 
6 (2%) 


94 (38%) 

i r i / / i o/\ 

151 (61%) 
1 33 (88%) 
18(12%) 
3 (1%) 


144 (53%) 

1 "1 1 / A A 0/\ 

121 (44%) 
99 (82%) 
22 ( 1 8%) 
7 (3%) 


< 0.001 
0.049 


pT 

P T2 


659 (82.4%) 


205 (73%) 


216 (87%) 


238 (88%) 


< 0.001 


Upstaging 
pT3/4 
EPE 
SVI 


141 (17.6%) 
136 (17%) 
8(1%) 


75 (27%) 
73 (26%) 
2(1%) 


32 (13%) 
30 ( 1 2%) 
3 (1%) 


34 ( 1 2%) 
32(12%) 
3 (1%) 




Margins 
Negative 
Positive 


/ 1 r /n a a o/\ 

675 (84.4%) 
125 (15.6%) 


~i~)n /oro/\ 

239 (85%) 
41 (15%) 


"\ i ~i /n no/\ 

217 (88%) 
31 (12%) 


219 (81%) 
53 (19%) 


0.077 


Percent tumour 
(n = 7l6) 

Median 

Range 


5 

3-15 


5 

4-10 


3 

1.5-9 


1 0 

5-20 


<0.00 1 


Final risk group 0 
Low 

Intermediate 
High 


362 (45.3%) 
286 (35.8%) 
152 (19%) 


1 35 (48%) 
65 (23%) 
80 (29%) 


91 (37%) 
1 23 (50%) 
13 (13%) 


1 36 (50%) 
98 (36%) 
38 (14%) 


< 0.001 


CAPRA-S risk groups 
0-2 
3-5 
^6 


629 (78.6%) 
1 65 (20.6%) 
6 (0.8%) 


208 (77.9%) 
58 (21.7%) 
1 (0.4%) 


152 (82.3%) 
34 (18.2%) 
1 (0.5%) 


143 (75.3%) 
44 (23.2%) 
3 (1.5%) 


0.42 



positive biopsy cores, and the presence or absence of palpable 
disease. The logistic regression equation for the nomogram is also 
included to facilitate future validation studies (Figure lb). 

Assessment of the nomogram to predict the presence of high- 
risk disease in a randomly selected evaluation cohort revealed 
reasonable accuracy (AUC of 0.72) (Figure 2). A calibration plot, 
comparing nomogram-predicted probabilities to actual propor- 
tions of high-risk disease, is shown in Figure 3. This shows that our 
nomogram underestimates the observed probability of high-risk 
disease. The two-component (calibration and AUC calculation) 
decomposition Brier score was 0.199. External validation for the 
nomogram generated from the Cambridge data, using Melbourne 
and Vancouver data, showed an AUC of 0.68 and 0.55, respectively. 



DISCUSSION 

It is clear that published inclusion criteria for AS (Table 1) vary in 
their stringency. The criteria at John Hopkins Medical Institute are 
the strictest but some centres elsewhere have accepted Gleason 7 
(usually 3 + 4), PSA levels up to 15ngml _1 and all clinical T2 
disease. Furthermore, not all centres use PSAD ( ^ 0.15-0.20), 
number of positive cores (^2, ^3 or 1/3 of total) and percentage 
(j^ 20-50%) of single core involvement to enrol patients. It has 
been demonstrated previously (Suardi et al, 2008; Conti et al, 2009; 
Mufarrij et al, 2010) that increases in stringency can decrease the 
rates of adverse pathological features but will also substantially 
decrease the number of men suitable for AS. Until the ProtecT trial 
reports (Lane et al, 2010), we will not know the criteria that predict 
the most important outcome of AS, namely death from PCa. 

Our paper reports combined results from the Australian, British 
and Canadian academic centres. These results are compared with 



Table 3b Pathological results, with upgrading and upstaging rates 
highlighted, from radical prostatectomy for patients suitable for the PRJAS 
criteria 





Total 
(PRIAS) 


Cambridge 


Melbourne 


Vancouver 


p 


n 


410 


134 


1 14 


162 




Pathological 
GS 

^6 
Upgrading 
7 

34-4 
4 + 3 
8-10 


235 (57.3%) 

1 66 (40.5%) 
138 (83.1%) 
28 ( 1 6.9%) 
9 (2.2%) 


87 (65%) 

44 (33%) 
40 (9 1 %) 
4 (9%) 
3 (2%) 


53 (46%) 

59 (52%) 
52 (88%) 
7(12%) 
2 (2%) 


95 (58%) 

63 (39%) 
46 (73%) 
17(27%) 
4 (3%) 


0.005 
0.023 


pT 

P T2 


359 (87.6%) 


1 15 (86%) 


104 (91%) 


140 (86%) 


0.43 


Upstaging 
pT3/4 
EPE 
SVI 


51 (12.4%) 
48 ( 1 1 .7%) 
4 (1%) 


19 (14%) 
19 (14%) 
0 (0%) 


10 (9%) 
9 (8%) 
1 (l°/o) 


22 (14%) 
20 (12%) 
3 (2%) 


0.27 


Margins 
Negative 
Positive 


*"> / A /DO 00/\ 

364 (88.8%) 
46 ( 1 1 .2%) 


i /n i o/\ 

1 22 (91 %) 
12(9%) 


i r\~> /r\r\o/\ 

103 (90%) 
1 1 (10%) 


1 39 (86%) 
23 (14%) 


0.3 


Percent tumour 
(n = 367) 

Median 

Range 


5 

2-10 


5 

0.2-80 


2 

0.3-37 


1 0 

1-70 


<0.00 1 


Final risk group 0 
Low 

Intermediate 
High 


219 (53.4%) 
1 33 (32.4%) 
53 (14.1%) 


79 (59%) 
33 (25%) 
22 (16%) 


52 (46%) 
51 (45%) 
1 1 (9%) 


88 (54%) 
49 (30%) 
25 (16%) 


0.014 


CAPRA-S risk groups 
0-2 (low) 
3-5 
^6 


346 (84.4%) 
60 (14.6%) 
4(1%) 


1 12 (89.6%) 
12 (9.6%) 
1 (0.8%) 


76 (87%) 
10 (12%) 
1 (l°/o) 


77 (79%) 
20 (20%) 

1 (l°/o) 


0.2 



Abbreviations: AS = active surveillance; CAPRA = Cancer of the Prostate Risk 
Assessment Post-Surgical Score; GS = Gleason sum; PRIAS = Prostate Cancer 
Research International: Active Surveillance. a Risk group patterned on D'Amico 
system with pT and Gleason sum from radical prostatectomy (instead of cT and 
biopsy Gleason sum). Risk groups 0-2 = low; 3-5 = intermediate; ^6 = high. 



the American (UCSF, h = 331 (Conti et al, 2009)) and European 
(Milan and Hamburg, n = 2455 and Milan, n = 85 (Suardi et al, 
2008; Suardi et al, 2010)) cohorts, where the Toronto and PRIAS 
inclusion criteria were also used to select patients from a RP 
database. Applying the Toronto criteria, we found a higher than 
previously reported rate of Gleason score upgrading (50.6%) 
compared with Conti et al 31% and Suardi et al 38.1%. However, 
similar to previous reports, the majority of GS upgrading was 3 + 4 
disease (84%) and this may not translate into a major clinical 
problem. The rate of upstaging (EPE/SVI) when using the Toronto 
criteria (17%) was similar to previous reports in the literature 
(14-15%). There was less upstaging and upgrading when using the 
stricter PRIAS criteria; however, our reclassification rates are 
markedly higher (upgrading 42.7% and upstaging 12.7%) com- 
pared with those previously reported by the Milan group 
(upgrading 7.1% and upstaging 2.4%; Suardi et al, 2008). The 
results of our subanalysis, by year (^2003) and number of biopsy 
cores taken (^8), demonstrated similar predictors to those found 
for the entire cohort. This was not surprising given that the two 
cohorts with high rates of upstaging (Cambridge) and upgrading 
(Melbourne) were the most contemporary with higher median 
number of cores taken (Tables 2a and b). 

Individually, each of our three centres has a relatively high rate 
of GS upgrading (Table 4, Cambridge 44%, Vancouver 47%, 
Melbourne 62%). Possible reasons for this include sampling error 
on biopsy, interobserver pathology variation between biopsy and 
RP reports and differing geographic population patterns of disease. 
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Table 4 Data (combined three centres) restricted to year ^2003 and 
total number of biopsy cores ^8 



Toronto criteria PRIAS criteria P 



Years of data 
N (total AS) 


2003-2010 
644 


2003-2010 
310 




Age (years) 
Median (IQR) 


61 (57-65) 


61 (57-65) 


0.55 


PSA (ngmr 1 ) 
Median (IQR) 


5.8 (4.7-7.4) 


5.6 (4.4-7.0) 


0.018 


Clinical stage 
cTI 

cT2a/cT2 


A / ~ 7 f~l"~\ ro/\ 

467 (72.5%) 
177 (27.5%) 


•""l T^l /~ 7 a r>o/\ 

232 (74.8%) 
78 (25.2%) 


0.45 


PSA density 0 
Median (IQR) 


0.1 1 1 (0.082-0.147) 


0.098 (0.07-0. 1 27) 


< 0.001 


Biopsy cores taken 
Median (range) 


10 (8-12) 


10 (8-12) 




Number of positive cores 
Median (IQR) 


2(1-4) 


1 (1-2) 


0.48 


Centre (n, %) 
Cambridge 
Vancouver 
Melbourne 


267 (41.5%) 
190 (29%) 
1 87 (29%) 


1 25 (40.3%) 
87 (28.1%) 
98 (3 1 .6%) 


0.8 


Prostatectomy Gleason score 
7 

3 + 4 

4 + 3 
8-10 


3 1 1 (48.3%) 
3 1 8 (49.4%) 

-) ~7 "i /or no/ \ 

273 (85.8%) 
45 (14.2%) 
1 5 (2.3%) 


176 (56.8%) 
127 (41%) 
102 (80.3%) 
25 (19.7%) 
7 (2.2%) 


0.24 


pT 

pT2 
pT3/4 


529 (82.1%) 
1 15 (17.9%) 


275 (88.7%) 
35 (1 1.3%) 


0.024 


EPE 
No 

Yes 


533 (82.8%) 
1 1 1 (17.2%) 


277 (89.4%) 
33 (10.6%) 


0.008 


SVI 

No 
Yes 


637 (98.9%) 
7 (1.1%) 


307 (99%) 
3(1%) 


0.87 



Abbreviations: AS = active surveillance; IQR = interquartile range; PRIAS = Prostate 
Cancer Research International: Active Surveillance; PSA = prostate-specific antigen. 
PSA density a = not part of original Klotz criteria but reported for comparison to Van 
den Bergh. 



The number of biopsy cores taken for each centre was: Cambridge 
(median = 12, IQR10-12), Melbourne (median = 10, IQR8-13) and 
Vancouver (median = 8, IQR8-8). The standard extended core 
template for prostate biopsy consists of 10-12 cores, and hence 
particularly in Vancouver biopsy under- sampling could affect our 
results. To minimise biopsy sampling error, Adamy et al (2011) 
suggest immediate confirmatory biopsy before commencing AS. 
There may be inter-observer variation influencing the Melbourne 
data as ~50% of biopsies are reported by community pathologists, 
whereas all RP specimens are read by two specialist uro- 
pathologists. In addition, most of the upgrading was predomi- 
nantly Gleason 3 + 4 disease (84% of those upgraded (Suardi et al, 
2008)). Results for carefully selected men on AS with intermediate- 
risk disease (Gleason sum 7 or CAPRA score 3-5) have been 
published suggesting that within limited follow-up (4years) 
outcomes were similar to men with GS 3 + 3 disease (Cooperberg 
et al, 2011). 
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Table 5a Predictors of high-risk( a ) disease for combined three centres, 
the Toronto and PRIAS selection criteria for AS groups 



Toronto AS criteria PRIAS criteria 



OR 95% CIs P OR 95% CIs P 



Age 


1.04 


1 .01-1.07 


0.02 


1.02 


0.97- 


-1.07 


0.47 


PSA 


1.08 


0.98-1.19 


0.1 


1.15 


1.01- 


-1.31 


0.043 


cT 


1.54 


1.01-2.34 


0.045 


1.85 


1.03- 


-3.32 


0.04 


Total no. of cores taken 


0.94 


0.88-1.01 


0.085 


0.95 


0.87- 


-1.04 


0.25 


No. of positive cores 


1.25 


1. 14-1. 37 


< 0.001 


0.91 


0.51- 


-1.61 


0.75 


Centre 
















Vancouver 


1 






1 








Cambridge 


2.85 


1 .68-4.84 


< 0.001 


1.55 


0.71- 


-3.38 


0.27 


Melbourne 


1.03 


0.59-1.79 


0.91 


0.76 


0.34- 


-1.7 


0.51 


Table 5b Predictors of high-risk( a ) disease for the Cambrid; 


ge cohort, 


Toronto AS selection criteria. 
















OR 




95% CIs 






P 


Age 


1.06 


1 




1. 


13 




0.049 


PSAD 


1.71 


1.03 


2.83 




0.037 


Ct 


2.71 


1.2 




6.12 




0.017 


Positive cores 


1.24 


I.Oi 


6 


1.46 




0.008 



Abbreviations: AS = active surveillance; CI = confidence interval; OR = odds ratio; 
PRIAS = Prostate Cancer Research International: Active Surveillance; PSAD = PSA 
density. a High-risk disease defined as ^pT3 and/or Gleason sum ^8. 



Cambridge had a high proportion of EPE in its Toronto criteria 
group (26% vs 12%, P< 0.001) compared with the other two 
centres. As pathology for biopsy and RP is centrally reviewed in 
Cambridge and an extended template used for biopsy, this high 
rate of pT3a disease could also be attributable to a low uptake of 
PSA testing in the United Kingdom. Melia (2005) compared 
worldwide rates of PSA testing in 2005 and, allowing for lack of 
standardized data, found rates of PSA testing in the United 
Kingdom considerably lower than elsewhere. At a similar period of 
time (year 2000), the rate of PSA testing in the United Kingdom 
was 5.4 tests per 100 men per annum (men 45-84 years old) 
compared with the United States, 38% of black men and 31% of 
white men (^65 years); Italy, 26.9% men (^40 years); Australia, 
23% men (40-70 years); and Canada (Beaulac et al, 2006), 47.5% 
(^50 years). 

The few previously published predictive tools for AS selection 
have focused on calculating likelihood of indolent, low-volume/ 
low-grade or insignificant PCa rather than high-risk features 
(Kattan et al, 2003; Ochiai et al, 2005; Nakanishi et al, 2007; 
Steyerberg et al, 2007; O'Brien et al, 2011). Nakanishi et aVs 
nomogram, using a cohort of 258 men from the Canada and the 
United States, is specific for men with a single-positive biopsy core 
and uses age, PSAD and tumour length in a core to predict 
indolent cancer (Nakanishi et al, 2007). Possibly having a single 
positive core is too stringent a criterion for a programme of AS. 
Kattan et al 's nomogram to predict small, moderately differ- 
entiated, confined tumours (Kattan et al, 2003) that was recently 
validated and updated (Steyerberg et al, 2007); however, its 
application is questionable as its data are based on sextant biopsies 
with most centres now performing 10-12 core biopsies. 
A nomogram generated from an Australian cohort was recently 
published (O'Brien et al, 2011), and for multiple probability cutoffs 
predicting indolent PCa, they gave coexisting rates of high-risk 
disease. It is likely that in the future men will be selected for AS 
programmes only if they have an mp-MRI that does not 
demonstrate any other unsuspected cancers, possibly coupled 
with a limited template biopsy. The result of the PIVOT trial (Wilt 
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37.0 50.0 60.0 73.0 



PSA density i 



i — i — i — . — i — i — i — i — i — i — i — i — i — ■ — i — i — i — | — i — i—i 
0.0 0.075 0.15 0.225 0.3 0.375 0.45 0.525 0.6 0.675 0.75 0.82 



Pos cores 



Total 



Probability 



0.0 1.0 2.0 3.0 4.0 5.0 6.0 7.0 8.0 9.0 10.0 11.5 14.0 



60 80 100 120 140 16 
0.3 0.5 0.7 



180 200 220 



0.8 0.9 0.95 



Figure I Nomogram to predict individual probability of high-risk PCa in 
UK men who meet the Klotz criteria. For a patient, each criteria (age, 
PSAD, number of positive cores and cT stage) translates to a number of 
points - read off the top line. The total number of points then corresponds 
to a probability of finding high-risk PCa if RP is performed. 
Logistic regression equation for nomogram. Log(p/ 1 -p) = — 7.05 I + 0.059( 
age in years) + 0.537(PSAD in units of 0. 1 ng ml cm 5 ) + 0.996(cT) +0.2 1 8 
(number of positive cores). 



ROC curve 




0.0 0.2 0.4 0.6 0.8 1.0 
1 - specificity 

Figure 2 Receiver operating characteristic curve to assess performance 
of nomogram predicting probability of high-risk PCa in UK men satisfying 
Klotz AS criteria. 




0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 
Predicted 

Figure 3 Calibration plot comparing nomogram-predicted probabilities 
to actual proportions of high-risk disease. 

ideal nomogram would also include information on previous 
biopsies, family history of significant CaP and results of MRI 
imaging; however, this was not present in our data set. Given the 
low rate of PSA testing in the United Kingdom and the high rate of 
upstaging in our Cambridge cohort, our nomogram would be a 
reasonable tool for counselling UK patients in regards to AS. 

Our study has limitations. Being retrospective, data collected 
were reliant on individual centres' protocols and they did not 
include all information on tumour volume. The use of additional 
criteria such as length or percentage of a single core involved 
might reduce the amount of reclassification, but would likely 
reduce the number of patients to whom AS could be offered. Data 
on lymph node status were lacking as lymph node dissection for 
low-risk disease was according to surgeon preference, and not 
consistently performed or recorded. Lack of follow-up data for 
cancer recurrence or death is significant, as having pathological 
features of advanced disease on biopsy does not necessarily 
translate to poorer outcomes after surgery. Using a surgical cohort 
includes unforeseen biases in patient selection not addressed in AS 
criteria, such as age, comorbidities, family history, patient anxiety 
for intervention, and findings on imaging or institutional bias 
towards type of treatment. The median age of our cohort (61 years) 
is younger than that reported by AS cohorts (65 years; Carter et al, 
2007; van den Bergh et al, 2009). Being a multicentre study, 
multiple pathologists reported biopsies and specimens, and the 
effect of inter- observer variation was not calculated. We also 
accept that, given that our study spanned a broad period of time, 
interpretive changes in pathological grading and clinical staging of 
PCa did occur (Thompson et al, 2005). Multiple surgeons 
performed the surgery though positive margin rates that were 
similar. 



et al, 2012) has demonstrated that for low- risk men, surgery offers 
limited advantages over watchful waiting, and AS programmes are 
likely to be at least as effective. 

To our knowledge, there has not been a nomogram derived from 
the British data to assist with selection of patients for AS. From our 
Cambridge data, we generated a nomogram that predicts presence 
of high-risk disease in patients who satisfy the Toronto entry 
criteria for AS. Overall assessment of the performance of our 
nomogram was good (AUC 0.72 and two-component Brier score 
0.199). However, the calibration plot suggests that our nomogram 
consistently underestimates the observed proportion of high-risk 
disease for nearly all predicted values. Unsurprisingly, the 
nomogram performed poorer when we used the Melbourne 
(AUC 0.68) and Vancouver (0.55) data to externally validate it. 
There is evidence that risk calculators are best applied to the 
population from which they are generated (Bhojani et al, 2009). An 



CONCLUSIONS 

Our study examined the rates of reclassification of men from 
Australia, Britain and Canada who underwent RP who initially 
would have been suitable for AS, as defined by the Toronto and 
PRIAS criteria. Compared with previously reported cohorts from 
Europe and North America using the same AS selection criteria, we 
found significantly higher rates of upgrading and upstaging. 

Care must be used when applying AS criteria generated from 
one population to another distinct population. There is an onus on 
larger centres in individual countries to assess the performance of 
different criteria on their population before implementation in 
routine AS programmes and generate predictive tools from their 
own data sets. Use of increasingly stringent selection criteria may 
reduce reclassification but this must be balanced against the 
exclusion of a significant number of men from AS who may benefit 
from such an approach. 
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The acceptability of AS protocols would best be evaluated by 
ongoing prospective studies. The development of novel serum or 
tissue markers, and improved imaging to predict disease progres- 
sion would help remove any uncertainties physicians and patients 
have with AS. 
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